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ABSTRACT 

Weak lensing by large-scale structure is an invaluable cosmological tool given that most 
of the energy density of the concordance cosmology is invisible. Several large ground- 
based imaging surveys will attempt to measure this effect over the coming decade, but 
reliable control of the spurious lensing signal introduced by atmospheric turbulence 
and telescope optics remains a challenging problem. We address this challenge with a 
demonstration that point-spread function (PSF) effects on measured galaxy shapes in 
current ground-based surveys can be corrected with existing analysis techniques. In 
this work, we co-add existing Sloan Digital Sky Survey imaging on the equatorial stripe 
in order to build a data set with the statistical power to measure cosmic shear, while 
using a rounding kernel method to null out the effects of the anisotropic PSF. We build 
a galaxy catalogue from the combined imaging, characterise its photometric properties, 
and show that the spurious shear remaining in this catalogue after the PSF correction 
is negligible compared to the expected cosmic shear signal. We identify a new source 
of systematic error in the shear-shear auto-correlations arising from selection biases 
related to masking. Finally, we discuss the circumstances in which this method is 
expected to be useful for upcoming ground-based surveys that have lensing as one of 
the science goals, and identify the systematic errors that can reduce its efficacy 
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1 INTRODUCTION 

Modern cosmologists can simulate the invisible implications 
of modern cosmological models (e.g., those t hat can explain 
the c osmic microwave background, incIuding lKomatsu et alj 
2011) to what is generally agree d to be a high level of preci- 
sion (and probably accuracy, c.f. lLawrence et alJl20ld V The 
easily observable consequences of these models for observa- 
tions of galaxies are not so easy t o calculate (e.g. Rudd et al.l 
l20Qg| ; IConrov fc Wechslerll2009l ; ISimha et al]|201Ch . invohX 
ing as they do the physics of the familiar but nevertheless 
stubbornly complicated baryons. Most of the precisely cal- 
culable components of these models - namely, the properties 
of the distribution of dark matter on large scales in relatively 
linear structures - are not readily observable. 

For the foreseeable future, the most direct observation 
of these dark components is the measurement of the gravi- 
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tational effects of dark structures on the images of distant 
background galaxies. These measurements are made almost 
exclusively via statistical estimation of the distortions in the 
cllipticities of background galaxies. This takes advantage of 
the fact that galaxies have no preferred orientation in a ho- 
mogeneous, isotropic universe. 

Lensing measurements have played a significant role in 
observational astrophysics in the last two decades, over a 
range of scales and physical regimes. Studies of galaxy evo- 
lution benefit from the ability t o understand the dark mat 
ter halos that host ga la xies (e.g. Hoekstra et alj 20041 . 2005 : 



Hev mans et al.l 



20061: iMandelbaum et all l2006al lbl. 



2009; 



Leauthaud et al. 201 ll ). Cosmologists have no other way to 
directly map the large-scale matter distribution, which is 
crucial for c onstraining mode l s of dark energy and modi- 
fied gravity (|Zhang et al.ll2007l : iReves et alj|2010l h On small 
scales, maps of the matter distribution can be tied directly 
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to tests of the cold dark matter paradigm and simulations 
of the formation and evolution of dark matter halos. 

Much has been made of the scientific potential of this 
technique. Five years ago, we ak lensing was ident ified by 
the Dark Energy Task Force jAlbrecht et alj 120061 ) as the 
most promising tool for constraining cosmological models. 
Several large ground-based and space-based survey propos- 
als place a weak lensing measurement among their primary 
science drivers, including the Panoramic Survey Telescope 
and Rapid Response System (Pan-STARRS jj, the Dark 
Energy Survey (DE Sffl the Hyper Suprime-Cam (HSC, 
iMivazaki et~afl l2006h survey, the Large Synoptic Survey 
Telescope (LSSTfl EucliolJand the Wide-Field Infrared 
Survey Telescope (WFIRSTfl 

For all the promise, the technical challenges for these fu- 
ture experiments remain formidable. An order-unity distor- 
tion to background galaxy images is produced by a physical, 
projected matter overdensity of 

_ c 2 d s 
ljctlt - 4nGd L d LS ' W 

where d_L, ds, and g(ls are the angular-diameter distance 
from the observer to the lens and source, and from the lens 
to the source, respectively. For characteristic distances of ap- 
proximately a Gpc, the critical surface density is 0.1 g cm -2 . 
Typical fluctuations in the matter density field projected 
over cosmological distances are a thousand times smaller 
than this, so order 10 Mpc-scale density fluctuations in the 
universe will typically produce changes in galaxy elliptici- 
ties of order e « 10 -3 to 10~ 2 in magnitude. In the shot- 
noise dominated regime, the leading-order contribution to 
the variance in the correlation function of the ellipticity dis- 
tortions is 



Var(&) = 



N 2 . ' 

pair 



(2) 



For a shallow ((z) = 0.5) galaxy survey with shape noise 
due to random galaxy ellipticities a e ~ 0.3 and 100 deg 2 
of sky coverage, reducing the shot noise contribution below 
the expected cosmological signal requires a surface density of 
usable source galaxies of at least a few per square arcminutc. 

Worse, for ground-based imaging surveys, the observed 
shape distortions arising from atmospheric turbulence and 
optical distortions from the telescope are typically of order 
several percent, with coherence over angular scales compa- 
rable to that of the lensing shape distortions. A competi- 
tive measurement of the amplitude of matter fluctuations 
requires suppressing or modeling these coherent spurious 
distortions to of order one part in 10 3 , and future surveys 
will need to do a factor of several better. 

Achieving both the statistical precision and control 
of systematic errors that is required for such a measure- 
ment has proved to be a challenge. The early detections 
jBacon et al.ll2000l;IVan Waerbeke et aUlgOOd: [Rhodes et al ' 



2001 



2003) 



iHoekstra et al.ll2002i ; iBrown et al.l 120031 ; I Jarvis et al 
showed the promise of the method and confirmed the 
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existence of lensing by large-scale structure at roughly the 
expected level. However, they also highlighted some of the 
systematic errors: in particular, B-mode shear (which can- 
not be produced by lensing at linear order and is thus indica- 
tive of systematic effects) was present at a sub-dominant but 
non- negligible level. Since then, the weak lensing commu- 
nity has moved in the direction of both deep/narrow surveys 
with the Hubble Space Telescope (HST) and wide/shallow 
surveys on the ground. The Cosmological Evolution Survey 
(COSMOS) is the pre mier example of the f ormer: in addition 
to 2- point statistics (|Massev et al.l 120071 : ISchrabback et all 
l2010h . it has also pr oduced three-dimen sional maps of the 
matter distribution (M assev et alj|2007f) and the lensing 3- 
point correlation function (|Semboloni et alj|201lh . Excellent 
control of lensing systematics in COSMOS was also achieved 
thanks to the small number of d egrees of freedom co ntrolling 
the PSF (mostly focus variation: iRhodes et al.|[2007l) and de- 
tailed modeling of charge transfer inefficiency l|Massev et al.l 
l2010h . However, COSMOS covers only 1.6 deg 2 , and the 
small field of view of HST instruments makes significantly 
larger surveys impractical. The principal recent ground- 
based cosmic shear programme has been the Canada-France- 
Hawaii Telescope Legacy Survey (CFHTLS). There are now 
several cosmic shear results from di f ferent subsets of the 
CFHTLS data JSemboloni et al .1120061 ; IHoekstra et alj|200d : 
iBeniamin et al.ll2007l ; iFu et alj|2008h . and the CFHT lens- 
ing team is completing a reanalysis using recent advances in 
PSF determination and galaxy shape measurement. 

In light of the efforts shortly to be made by large, ex- 
pensive surveys to measure cosmic shear, we consider it im- 
perative to show that such a measurement can be performed 
accurately, without significant contaminating systematic er- 
rors, from a ground-based observatory. This goal includes 
doing a cosmic shear measurement with each of the wide- 
angle optical surveys that presently exist. To this end, we 
have re-coadded the repeat observations on the equatorial 
stripe (stripe 82) of the Sloan Digital Sky Survey (SDSS), 
using methodology that will optimise these new coadds for 
precision shear measurements. Our goal is to reduce the 
systematic errors arising from uncorrected PSF anisotropics 
below the statistical errors. We begin by specifying our re- 
quirements in Sec. [21 and describing the data that we use in 
Sec. [3] A description of the coaddition and catalogue-making 
pipeline follows in Section [4] We describe our method for 
estimating two-point functions of star and galaxy shapes in 
Sec. [5] Demonstrations of the data quality and suitability for 
sensitive weak lensing measurements are described in Sec- 
tion [5] We conclude with lessons for future experiments in 
Sec. [3 



2 DESIGN REQUIREMENTS 

Weak lensing measurements on large scales are vulnerable 
to a variety of systematic measurement errors. In order to 
measure cosmic shear on the scales described above, we must 
first have a clear idea of what the possible sources of these 
systematic errors are, and to what level (quantitatively) they 
must be suppressed. This section describes in turn the most 
common generic sources of measurement error relevant for 
weak lensing, and lays out quantitative methods for detect- 
ing their presence in our final catalogue. 
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The PSF0 of the SDSS survey exhibits significant spa- 
tial and temporal variations across the entire survey. We 
model these effects as a spatially-varying convolution kernel 
G. The observed image I(x) at some position x is related 
to the "true" image / by 



J(x)= / f(y)G(x-y)d 2 y, 



(3) 



where G is the convolution kernel appropriate to the region 
of sky under examination. 

One effect of a spatially-varying PSF G is to produce a 
spurious shear field determined by the atmosphere and tele- 
scope that is statistically independent of and superposed 
upon the undistorted galaxy shape pattern. Point sources 
(stars and completely unresolved galaxies - we have no need, 
at present, to distinguish these) sample only the field sourced 
by G, and so can be used to constrain a model for the sys- 
tematics field. Any uncorrected additive shear contribution 
due to the ellipticity of G will produce a correlation between 
the measured galaxy and point-source shapes. This additive 
shear will be statistically uncorrelated with the true cosmic 
shear signal. 

The masking steps of the catalogue construction pro- 
cedure can also produce a significant shape selection bias. 
For the photometric pipeline used here, masked regions are 
defined as sets of pixels; a galaxy is rejected from the cata- 
logue if the set of pixels making up a galaxy intersects the 
set of masked pixels. On the masked region boundary, galax- 
ies aligned across the mask boundary are more likely to be 
rejected from the catalogue than galaxies aligned along it 
producing a spurious shear. This will affect both stars and 
galaxies, but the effect on spurious galaxy ellipticities will 
be much larger than that on stars (as the dispersion in mea- 
sured stellar ellipticities is very small). This mask selection 
bias produces an additive shear, which will also be statisti- 
cally uncorrelated with the true cosmic shear signal. 

These two effects enter together as an additive term in 
the shape clustering statistics, as 



measure 



d(#) — £cosmic(#) -f- £ s; 



tics(#)- 



(4) 



The point-source and galaxy populations have different 
sensitivities to the ellipticity of the PSF, to optical distor- 
tions, and to the geometry of the mask. If these are ac- 
counted for, a measurement of the point source-galaxy cross 
correlation provides a straightforward estimate of the spu- 
rious signal sourced by uncorrected PSF variation^. We will 
require that the amplitude of this spurious correlation in 
our final shape catalogue be sub-dominant to the statisti- 
cal errors - in particular, that the additive PSF systematics 
amplitude be constrained to less than the statistical errors. 

The average ellipticity measured for the gravitationally 
sheared images of a population of galaxies is proportional to 
the applied shear; the exact value of this calibration depends 
on the surface brightness profiles of the galaxies. We will 



6 Here we use the term "PSF" to denote all contributions: the 
atmosphere, optics, tracking errors, charge diffusion, and pixeliza- 
tion. 

7 This statement is true for sufficiently large areas that any 
chance superpositions of PSF ellipticity and the lensing shear 
average out. For this reason, we impose this test on chunks with 
area ^ 25 deg 2 . 



address the shear calibration uncertainties in a companion 
paper. 



3 DATA 

3.1 The Sloan Digital Sky Survey 

York et al.|[2000h and 



The Sloan Digital Sky Survey (SDSS 



its successor SDSS-II (|Frieman et al.l [20081 mapped 10000 



square degrees across the north galactic cap using a ded- 
icated wide-field 2.5 m telescope at Apache Point Obser- 
vatory in Sunspot, New Mexico dGunn et al.l [2006). The 
SDSS camera, described in lGunn et aL (|l998), images the 
sky in five optica l bands (u.a.r.i.z; iFukugita et al.l 1 19961 ; 
I Smith et al.l l2002h with the charge-coupled device (CCD) 
detectors reading out at the sidereal rate. Each patch of 
sky passes in sequence through the five filters (in the order 
r,i,u,g,z) along one of the six columns of mosaicked CCDs, 
and is exposed once in each filter for 54. 1 s. T h e site is mon- 
itored for photometricity (|Hogg et al.l l200ll; iTucker et al.l 
2006). Data undergo quality assessment Ijlvezic et al.l l2004 ). 
and final calibration is done using the "ubercalibration" pro- 
cedure based on photometr y of stars in run overlap regions 
([Padmanabhan et al.ll2008 | ). We use the data from the sev- 
enth SDSS data release l Abazaiian et al]|2009l ). with an up- 
dated calibration from the subsequent data release. 

The footprint of one night's observing is six columns of 
imaging the width of one CCD (13.52 arcmin) separated by 
slightly less than one CCD width (11.65 arcmin). Imaging 
taken during a continuous period of time on one night is col- 
lectively termed a run; each separate column of imaging is, 
sensibly, a camera column (or "camcol"), and the imaging 
along each camera column is chopped for processing pur- 
poses into 8.98 arcmin long frames or fields. Successive runs 
are interleaved, in order to fill in the gaps between camera 
columns. Pairs of interleaved runs along the same great cir- 
cle are stripes (each of which has a north and a south strip) . 



3.2 Stripe 82 

Most of the SDSS imaging data were acquired in the north- 
ern galactic cap, with galactic latitude \b\ > 30. For commis- 
sioning, and during sidereal times when the primary survey 
region was unavailable, the telescope frequently imaged a 
2.5 degree wide stripe of sky along the celestial equator with 
right ascension (RA) in the i nterval -50 < RA < +50°. The 
SDSS-II supernova project dFrieman et al.l 120081 ) observed 
this region many times during the months of September- 
November over the years 2005-2007 to collect multicolour 
light curves of Type la supernovae. In the survey nomen- 
clature, this region is Stripe 82. At any given location on 
the Stripe, there are on average 120 contributing interleaved 
imaging runs, comprising in aggregate almost as much imag- 
ing data as exist in the remainder of the combined SDSS-I 
and SDSS-II footprint. It is here that significant gains can 
be made from image coaddition. 



3.3 Single-epoch data processing 

The raw imaging data is processed by the automate d 
SDSS photometric pipeline, Photo (|Lupton et alj 120011 ). 
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This pipeline has components to handle astrometric and 
photometric calibration as well as catalogue construction; 
it also generates an array of data quality measurements 
describing the telescope point-spread function (PSF), the 
locations of unreliable pixels, and measurements of the 
photometric quality of individual frames. Many of these 
data quality indicators are used during the construction 
of the coadd imaging and its associated catalogue. Their 
use is described below. A detailed description of the im- 
age processing pipelin e and its outputs can be found in 
IStoughton et al.l |2002t) . Outputs can be f ound in locations 
speci fi ed by the SPSS data release papers (|Abazaiian et al 



1x10 



2003, 200 4 I2005I; lAdelman-McCarthv et al 
20od ; lAbazaiian et alj|2009h . 



2006, I200 



Photo produces a number of intermediate outputs for 
the single-epoch SDSS imaging that we use in the coaddition 
process. Corrected Frames, or f pC files, are produced by the 
pipeline from the raw CCD images of single frames; these 
are bias-subtracted and flat-fielded, and a non-linearity cor- 
rection is applied where appropriate. These are the images 
that are combined during the coaddition process below. 

Photo also generates a bitmask (an f pM file) for each 
frame describing pixels that are known to be defective. Pix- 
els are marked in this bitmask as saturated, cosmic-ray con- 
taminated, interpolated (if a column or pixel is known to be 
saturated, or is a priori marked as unreliable, Photo inter- 
polates over that region). We use these bitmasks to exclude 
bad pixels from the image coaddition. 

Astrometric solutions (asTrans files) are produced by 
Astrom for each SDSS frame. Systematic errors in the 
astrometric positioning are reported to less than 50 mas, 
and the relative astrometry bet ween successive overlapping 
frames is approximately 10 mas (Pier et al J2003l ). The astro- 
metric solution for each run (|Pier et al.ll2003l )is determined 
by matching against astrometric standard stars from the 
USN O CCD Astrograph Catalogue fUCAC [Zacharias et ail 
2000) catalogue. The coaddition algorithm relies on the as- 
trometric solutions provided; we have found it unnecessary 
to re-solve the astrometry. 

For photom etric calibration, we use the "ubercal" solu- 
tions derived by iPadmanabhan et al.1 (|2008l ). 

The SDSS pipeline uses bright, isolated stars with ap- 
parent magnitudes brighter than 19.5 to construct a model 
of the PSF and its variation across each frame. For each 
frame, the stellar images for the three neighbouring frames 
along the scan in both directions are used to produce a set 
of Karhu nen-Loeve (KL) eig enimages describing the PSF 
variation l|Lupton et al.l F200L). A global PSF model for the 
frame is constructed by allowing the first few KL compo- 
nents to vary up to second order in the image coordinates 
across the frame, with the coefficients of the variation being 
tied to the aforementioned bright stars. The KL eigenim- 
ages and coefficients of their spatial variation are stored by 
Photo for each band in the psField files. These are taken 
as inputs to the coaddition process and used for PSF cor- 
rection. We will test the fidelity of this PSF model in the 
coadded images on stars that were too faint to perform a 
reliable PSF determination in the single-epoch data. 




12 3 4 

psf size (arcsec) 

Figure 1. The distribution of PSF FWHM in the r band for 
all frames on Stripe 82. The half-width of the target PSF after 
rounding is indicated by the solid vertical line. 



4 ALGORITHMS 

Our general strategy for corr ecting for the effects of see ing is 
similar to that suggested in lBernstein fc Jarvisl l|2002T ). We 
will apply a rounding kernel to each single-epoch image prior 
to stacking the ensemble. The large variation in SDSS PSF 
sizes (see Fig. [1} will require a trade-off between rejection of 
a large fraction of the available imaging, and significant dilu- 
tion of the signal due to the rounding convolution. Stacking 
the images without a kernel, however, will produce a PSF 
with large variations - including steps at run boundaries or 
the edges of regions masked due to e.g. cosmic rays - that 
will be difficult to model accurately. 



4.1 Field smoothing 

This section describes the operation of smoothing the map 
so as to make the effective PSF equal to some target PSF. 
Here we will denote the intrinsic PSF of the telescope by 
G(x), so that if the intrinsic intensity of an object on the 
sky is f(x), the actual image observed is 



I(x) = / G(y).f(x - y)d 2 y = [G ® f](sc) 



(5) 



Of course this image is only sampled at values of x corre- 
sponding to pixel centres. Our principal objective here is to 
construct the kernel K such that 

[I® K](x) = [r® f](x) or [G ® K](x) = F(x), (6) 

where F is the target PSF. In order to do this, we need to first 
choose a target PSF V and then determine the appropriate 
convolution kernel K, which will differ for every imaging run 
contributing to the coadds at a given position depending on 
the full position-dependent PSF model in each run. These 
are the subjects of Sees . 14. PI and [4.1.21 respectively. 
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4.1.1 The target PSF 

Here we consider the target PSF F. It must be constant 
across different runs in order for the co-add procedure to 
make sense, although it need not be the same in different 
filters. There is a large advantage in having F be circu- 
larly symmetric. Gaussians are convenient since most galaxy 
shape measurement codes are based on Gaussian moments, 
but this is not a requirement. In fact the PSF G delivered 
by most telescopes, including the SDSS, has "tails" due to 
the atmosphere at large radius that are far above what one 
could expect from a Gaussian. These can in principle be re- 
moved by a convolution kernel K that has negative tails at 
large radius, but there are problems when these tails extend 
to the field boundaries or across bad columns in the CCD. 
Therefore we have chosen the double- Gaussian form for V: 



r(x) = 



2no 2 



/w e -x 2 /2a= , 



2-ko 2 



2 /2a 



(7) 



with 02 > 01. This functional form manifestly integrates to 
unity, and has a fraction / w of the light in the "large" Gaus- 
sian. The two Gaussians have widths o~\ and 02, respectively, 

with G 1 < 0"2- 

The parameters of the double-Gaussian were adjusted 
by trial and error so that a compactly supported kernel K 
(13 x 13 pixels) can achieve G®K « V for a wide range of real 
PSFs G delivered by the SDSS. The most critical parameter 
is the width of the central Gaussian, o\. This is the main 
parameter controlling the seeing of the final co-added image: 
if it is set too high then many galaxies become unresolved, 
whereas if it is set too low then a large number of fields with 
moderate seeing will have to be rejected because it will be 
impossible to find a kernel K that achieves the target PSF 
without dramatically amplifying the noise. 

The PSF size distribution in the r band is shown in 

Fig. hi 

4-1.2 The convolution kernel and its application 

Equation (|6} can formally be solved in Fourier space by tak- 
ing the ratio, K(k) = T(k)/G(k), where the tilde denotes 
the Fourier transform and k the wave vector. Unfortunately, 
this simple idea comes with two well-known problems. One 
is that if the PSF has power only up to a certain wave num- 
ber fe max , then it is impossible to divide by G(k) — 0. The 
other is that the PSF varies slowly across the field, i.e. G in 
Eq. (0 formally depends on x as well as y. 

The solution to the first problem is that instead of tak- 
ing a simple ratio in Fourier space, we minimise the L 2 norm 
of the error, 



(8) 



£1 = J \T(x) - [G <g> K]{x)\ 2 d 2 x = ||r - G <g> K\\ 2 
subject to a constraint on the L 2 norm of the kernel: 
£a= f \K(x)\ 2 d'x = \\K\f. 



(9) 



If the input noise is white (which is a good approximation), 
then the noise variance on an individual pixel in the con- 
volved image is "£2 times the noise variance in the input 
image. Roughly speaking, for kernels that attempt to "de- 
convolve" the original PSF, and consequently have large pos- 
itive and negative contributions, £2 will come out to be very 



large. We adopt a requirement that £2^1. For kernels that 
poorly approximate the target PSF V, £1 will be very large. 
The problem of minimising £1 subject to a constraint on £2 
is most easily solved by transforming to the Fourier domain 
and then using the method of Lagrange multipliers: 



K{k) = 



G*(fc)f(fc) 
|G(fe)| 2 +A' 



(10) 



Here the positive real number A is the Lagrange multiplier 
and its value is adjusted until £2 = 1. A plays the role of 
regulating the deconvolution; indeed one can see that for 
Fourier modes present in the image, G(k) ^ 0, we have 
lim A ^ 0+ K(k) = f(fc)/G(fc). 

To summarise, Eq. (I10p finds the convolving kernel K 
that makes the final PSF G ® K as close as possible (in the 
least-squares sense) to V without amplifying the noise. The 
kernel is truncated into a 13 x 13 pixel region centred at 
the origin in order to avoid boundary effects and to prevent 
problems such as bad columns, saturated stars, or cosmic 
rays from "leaking" all over the field. We also re-scale the 
resulting kernel to integrate to unity {K(0) = 1) but since 
A is small, typically of order 10 -5 , this has no practical ef- 
fect. Note that since G(x) and T{x) are both real functions, 
it follows that in Fourier space they satisfy the conditions 
G(k) = G*{-k) and f(fe) = f*(-fe), and then Eq. flO) 
guarantees that a similar condition holds for K: the convo- 
lution kernel K(x) is real. 

The second problem - the variation of the PSF across 
the field - is handled by taking the reconstructed PSF on 
a grid of 8 x 6 points separated by 298 pixels (2 arcmin- 
utes) in each direction, and constructing a grid of 48 kernels 
K. The kernels are then interpolated bilinear ly between the 
four nearest grid points, and then the final image F(x) is 
constructed according to 



F(x) = j Kv(y)I(x-y)d 2 x, 



(11) 



where K m is the kernel reconstructed at position x in the 
field. 

The convolution kernel will not capture PSF model fluc- 
tuations on scales below 2 arcminutes. Since the SDSS model 
PSFs are quadratic functions over the chip, features at the 
arcminute scale and smaller are not captured anyway. We 
show below that, even at 6 — 1 arcmin, the remaining PSF 
variations not captured by the kernel are very small com- 
pared to the expected shot-noise errors in the two-point 
statistics at those scales. 

Obviously there will be instances in which the kernel 
reconstruction is not good enough. Therefore a set of cuts 
must be applied to the resulting kernels. In order to con- 
struct these cuts, we consider the Gaussian-weighted mo- 
ments of the residual r — G <£> K, i.e. 



M, 



[r-G®K](x)^e-* ' 2 ^d 2 x. (12) 



The cuts are then: 



1. We reject an entire field if the SDSS software used to 
determine the PSF (the postage stamp pipeline, or psp) 
failed to determine a good PSF model in the single-epoch 
imaging, or was forced to use a low-order fit to the PSF 
(PSP_STATUS!=0). 
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Table 1. Parameters for the PSF repair in different filters. 



Parameter 


u 


9 


r 


i 


z 


Units 


Target PSF parameters 


01 (PSF.SIZE) 


1.80 


1.40 


1.40 


1.40 


1.40 


pixels 


CT2 (PSF_SIZE_WING) 


5.10 


5.10 


5.10 


5.10 


5.10 


pixels 


/„ (FRACWING) 


0.035 


0.035 


0.035 


0.035 


0.035 


pixels 


FWHM of target PSF V 


1.68 


1.31 


1.31 


1.31 


1.31 


arcsec 


50 per cent Encircled Energy Radius 


0.86 


0.67 


0.67 


0.67 


0.67 


arcsec 




Kernel acceptance 


parameters 








CUT.L2 


0.001 


0.0025 


0.0025 


0.0025 


0.0025 




CUT.OFFSET 


0.04 


0.01 


0.01 


0.01 


0.01 




CUT.ELLIP 


0.002 


0.0005 


0.0005 


0.0005 


0.0005 




CUT.SIZE 


0.01 


0.0025 


0.0025 


0.0025 


0.0025 




CUT.PRQF4 


0.04 


0.01 


0.01 


0.01 


0.01 




Co-addition parameters 


DELTA_SKY_MAX 1 


0.5 


0.25 


0.25 


0.25 


0.25 


nmgy arcsec 2 


DELTA_SKY_MAX2 


0.04 


0.02 


0.02 


0.02 


0.02 


nmgy arcsec -2 



2. We reject cases where the PSF residual is too large re- 
gardless of the moments, i.e. 

I|r -,gg g " a > CUT_L2. (13) 
ll r ll 

3. We reject cases where the Gaussian-weighted offset is 
more than CUT.OFFSET ai, i.e. 



M oi + M io > CUT_0FFSET. 



(14) 



4. We reject cases where the ellipticity of the final PSF 
exceeds CUT_ELLIP, i.e. 



x/(M 02 - M 20 ) 2 + (2Mn) 2 > CUT_ELLIP. 



(15) 



5. We reject cases where the PSF size error exceeds 
CUT.SIZE, i.e. 



|M 22 - Moo I > CUT.SIZE. 



(16) 



6. We reject cases where the radial profile of the PSF is 
severely in error as determined by the fourth moment, i.e. 

|M 40 + 2Af 2 2 + M 04 - 2M o| > CUT_PR0F4. (17) 

The specific values of the parameters chosen for the cuts 
depend on the band and are shown in Table [T] The tightest 
constraints on the quality of the PSF are in g, r, i, and z 
bands [r and i are used to measure galaxy shapes) . In the u 
band, where the average image quality is much lower than in 
the other bands, more liberal cuts can be applied because we 
are interested primarily in the total flux, not the shape. Also 
there is more to gain from liberal cuts because the signal-to- 
noise ratio in u band is lower. Nevertheless, a serious error 
in the size of the PSF will result in erroneous photometry, 
and spurious ellipticity could introduce colour/photo- z or 
selection biases that depend on galaxy orientation, so some 
cuts must be applied. 



4.2 Noise symmetrisation 

It is a well-known fact in weak lensing that even if the PSF 
in an image has been corrected to have perfectly circular 



concentric isophotes, it is possible to produce spurious ellip- 
ticity if there is anisotropic correlated noise. For example, if 
the PSF is elongated in the x\ direction and is "fixed" by 
smoothing in the X2-direction, the resulting map has more 
correlations in the xi direction than x\. This can lead to (1) 
centroiding biases, in which the error on the galaxy centroid 
is larger in the £2 than the x\ direction, thus yielding more 
galaxies that appear aligned in the X2 than x\ direction; 
and (2) biases in which noise fluctuations tend to be elon- 
gated in the X2 direction, so that positive noise fluctuations 
on top of a galaxy (which increase its likelihood of detec- 
tion) tend to make it aligned in the X2 direction whereas 
negative fluctuations (which decrease the likelihood of de- 
tection) make the galaxy aligned in the xi direction. For 
a detai l ed de scri ption of noise-induc e d ellip ticity biases, see 
iKaised (|2000l ) or llBernstein fc Jarvisl i|2002l i These phenom- 
ena can all mimic lensing signals and hence should be elim- 
inated from the data. Our method of doing this is to add 
synthetic noise to each field so as to give the noise proper- 
ties 4-fold rotational symmetry. To be precise, we want the 
power spectrum of the total noise (actual plus synthetic) to 
satisfy: 



P N {k) = P N (e 3 x fc), 



(18) 



where e$ is a vector normal to the plane of the image; the 
cross product operation e$ x rotates a vector by 90 degrees. 
Even though it is not circularly symmetric, this is sufficient 
to guarantee zero mean ellipticity for a population of ran- 
domly oriented galaxies because ellipticity reverses sign un- 
der 90 degree rotations[f| In principle m-fold symmetry for 
any integer m ^ 3 would suffice, however 4- fold symmetry is 
the only practical possibility for a camera with square pix- 



8 In group theory language, the noise properties are symmetric 
under the 4-fold rotation group C4, which is a subgroup of the 
full rotations SO(2). The condition for zero mean ellipticity due 
to noise is that ellipticity fall into one of the non-trivial represen- 
tations of the noise symmetry group. 
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els. For obvious reasons, we would like to achieve this by 
adding the minimal amount of synthetic noise possible. 

The simplest way to achieve Eq. (| ISp is to decompose 
the power spectrum into its actual ("act") and synthetic 
("syn") components: 

p N (k) = pjr*\k) + p^ yn) (k). (19) 

The actual component is the white noise variance v in the 
input image, smoothed by the convolving kernel: 

P^ ct) (k)=v\K(k)\ 2 . (20) 

Since K is real, this power spectrum has 2-fold rotational 
symmetry: P^ ct ^(fe) = P^ ct \—k). The minimal synthetic 
noise power spectrum that satisfies Eq. (|18l) is then 

P<T> (k) = max [p^ (e 3 x fe) - P^ ct) (fe), o] . (21) 

Gaussian noise with this spectrum can be obtained by 
taking its square root, 

f (fe) = y/p<T\k), (22) 

and transforming to configuration space T(x). Then one 
generates white noise with unit variance and convolves it 
with T. Since the PSF and hence K varies across the field, 
T must also vary; its value is interpolated from the same 
8x6 grid of reference points as used for K. 

The Gaussian white noise was generated using Numeri- 
cal Recipe s gasdev modified to use the ran2 uniform deviate 
generator ijPress et al.lll992n . The seed was chosen by a for- 
mula based on the run, camcol, field number, and filter to 
guarantee that the same seed was never used twice in the re- 
ductions, and that the same sequence will be generated if the 
software is re-run. For each field, a sequence of 2048 x 1361 
Gaussian deviates is generated; since there are 128 rows of 
overlap between successive fields, we fill in the last 128 rows 
of each field with the first 2048 x 128 deviates from the next 
field. It is also essential that the period of the generator 
be longer than the total number of pixels in the survey (of 
order a few xlO 12 ), a requirement which is not fulfilled by 
many generators, since otherwise the same synthetic "noise" 
pattern will repeat itself throughout the survey. 

The image F(x) after addition of the synthetic noise is 
a klmage. 

4.3 Single-image masking 

Once the kernel-convolved, noise-added image (klmage) is 
constructed for each run that will contribute to the coadds 
at a given position, it is necessary to construct a mask before 
co-addition. The mask must remove the usual image defects 
as well as diffraction spikes. It is constructed as described in 
this section, and is termed the kMask. 

We begin by masking out all pixels in F(x) for which the 
convolution (Eq. llip integrates over a bad pixel. Since K has 
compact support - it is nonzero only in a 13 x 13 pixel region 
- this means that for each bad pixel in I(x) we mask out a 
13 x 13 block in F(x). Our definition of a "bad pixel" is one 
that is out of the field; was interpolated by PHOTO (usually 
due to being in a bad column); is saturated; is potentially 
affected by ghosting (via the f pM ghost flag); was not checked 
for objects by PHOTO; is determined by PHOTO to be affected 
by a cosmic ray; or had a model subtracted from it. Note 



that the first cut means that a 6-pixel region is rejected 
around the edge of the field. 

The second and more sophisticated mask is applied to 
remove diffraction spikes from stars. The secondary sup- 
port structure responsible for the diffraction spikes is on 
an altitude-azimuth mount, so that the diffraction spikes 
appear at position angles of 45, 135, 225, and 315 degrees 
in the altitude-azimuth coordinate system. Therefore in the 
equatorial runs, the orientation of the diffraction spikes rel- 
ative to equatorial coordinates changes depending on the 
hour angle of observation. If no correction for this is made, 
then after co-addition of many runs, even moderately bright 
stars have a hedgehog-like pattern of diffraction spikes at 
many position angles that can affect a significant fraction of 
the area. 

Our procedure for removing diffraction spikes is as fol- 
lows. We first identify objects with a PSF flux (i.e. flux de- 
fined by a fit to the PSF) exceeding some threshold (corre- 
sponding to 9.7 x 10 5 , 8.5 x 10 5 , 2.2 x 10 5 , 1.7 x 10 5 , and 
1.1 x 10 6 electrons in filters r, i, u, z, and g respectively). 
Around these objects, we mask a circle of radius 20 pixels 
(8 arcsec) and four rectangles of width 8 pixels (3 arcsec) 
and length 60 pixels (24 arcsec). The rectangles have the 
object centroid at the centre of their short side, and their 
long axis extends radially from the centroid in the direction 
of the expected diffraction spike. 

4.4 Resampling 

In order to co-add images, we must first resample them into 
a common pixelization. Ideally we would like this pixeliza- 
tion to be both conformal (no local shape distortion) and 
equal-area (convenient for total flux measurements). Unfor- 
tunately because the sky is curved, it is impossible to achieve 
both of these conditions. However since our analysis uses a 
narrow range of declinations around the equator (|<5| ^ 1.3°) 
we can come very close by choosing a cylindrical projection; 
the obvious choices are Mercator (perfectly conformal) or 
Lambert (perfectly equal-area). In our case the Mercator 
projection would result in the pixel scale being different by 
A9/9 = 2.6 x 10~ 4 at the Equator versus at 5 = ±1.3°. 
(The area error is twice this, or 5.2 x 10 -4 .) The Lambert 
projection would preserve shapes at the Equator but the co- 
ordinate system would have a shear of 7 = 2.6 x 10~ 4 at 
5 = ±1.3°. Neither of these problems is particularly serious, 
since either could if necessary be corrected in the flux or 
shape measurements. We have chosen the Mercator projec- 
tion because the small cosmic shear signal means that we 
are much more sensitive to a given percentage error in shear 
than in flux. Also, a flux error of 5.2 x 10~ 4 is insignificant 
compared to the error in the flatfields, so there is no point 
in eliminating it at the expense of complicating the shear 
analysis. 

The scale of the resampled pixels must be smaller than 
the native pixel scale on the CCD (~ 0.396 arcsec) in order 
to preserve information. However it is desirable for it not to 
be too small, since this increases the data volume with no 
increase in information content. It should also not be nearly 
equal to the CCD scale in order to avoid production of a 
moire pattern with large-scale power. We have used 0.36 
arcsec. 

The actual resampling step requires us to interpolate 
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the image from the native pixelization onto the target Mer- 
cator pixelization. This is done by 36-point second-order 
polynomial interpolation on the 6x6 grid of native pixels 
surrounding the target pixefl A target pixel is considered 
masked if any of the 36 surrounding pixels are masked. 

4.5 Addition of images 

After resampling the images, the next step is to combine 
them to produce the co-add. The combination proceeds in 
three steps: comparison of images to reject "bad" regions 
that were not masked in earlier stages of the analysis; rel- 
ative sky estimation; and stacking. Note that bad regions 
must be explicitly rejected: "robust" algorithms such as the 
median are nonlinear and slightly biased, and result in a final 
co-added PSF that depends on object flux and morphology, 
which is not acceptable for lensing studies. 

Rejection of bad regions is critical because it is possi- 
ble for some serious defects such as satellite trails to "leak 
through" earlier stages of the analysis and not be kMasked. 
Rejection at this stage is also the best way to eliminate solar 
system objects, most of which will be known, but which are 
not easily identified in the single-epoch fpCs but of course 
will not show up at the same coordinates in successive runs. 
We first bin each input image into 4x4 resampled pixels. We 
then compare the binned images and reject the brightest or 
faintest image if it differs by more than DELTA_SKY_MAX1 from 
the mean. When this rejection is done, we actually mask a 
20 x 20 resampled pixel region around the affected area. (We 
found that without this padding region, satellite trails were 
often incompletely masked because they passed through the 
corners of some 4x4 regions and did not sufficiently affect 
the mean flux.) 

Next we compute the difference in sky level among 
all of the N images. This difference must be determined 
and removed because otherwise a masked pixel in an im- 
age with below-average sky will appear as a bright spot in 
the co-added image. We compute the relative sky level - an 
often-neglected step in coaddition - as follows. For each pair 
(i, j) of co-added images, we compute the difference map 
Fi — Fj and take the median in 125 x 125 resampled pixel 
blocks. This is taken as an estimate of the sky difference 
Si — Sj. From these differences we obtain the unweighted 
least-squares solution for the sky levels {Si}, up to an ad- 
ditive offset (the absolute sky level cannot be determined 
by this procedure). The mean of these levels is denoted by 
S = J2iLi Si/N. We add to the ith image the quantity S— Si 
interpolated to a particular point x by 4-point interpolation 
from the nearest block centres. An entire block is masked 
out if \S - Si\ > DELTA_SKY_MAX2 and if it is an extremal 
value (either the highest or lowest sky value). 

The stacking of the images works by the usual formula 

Ftot{x) = ^L^M, (23 ) 
wtot as) 



Polynomial interpolation on an equally-spaced grid of points 
converges to sine interpolation in the limit that the number of 
gridpoints is taken to infinity. This is easily seen from the La- 
grange interpolation formula and the infinite product, 
n^=i(l ~~ m/n)(l + x/n) = sui(-kx)/(itx). 



where Wtot(x) = Ysi=i w i{ x ) an d w i are the weights. Be- 
cause the noise is correlated, the optimal weights are scale- 
dependent; we have chosen the optimal weights in the limit 
of small fc, i.e. large scales. That is, Wi = u _1 where v is the 
noise variance in image i. For photometry of large objects, 
w%ot can be thought of as an inverse white noise variance, i.e. 
the mean square noise flux in a region of area Q is l/wtotfi- 
However for small objects (which are always our concern) 
this is not the case and the error bars must be computed 
from the measured noise properties of the co-add. 

An example of a co-added image, and comparison to a 
single-epoch image, is shown in Fig. [2] 

4.6 Additional masking 

Before constructing the photometric catalogues, we zero all 
pixels contamina ted by bright stars in the Tycho catalogues 
|H0g et alj|200ch . replacing them with random noise of ap- 
propriate amplitude. Pixels masked in this manner have the 
'INTERP' bit set in the input fpM files, so that the down- 
stream analysis can exclude objects that incorporate pixels 
from a masked region. Pixels that are kMasked (according to 
one of the above criteria) also have 'INTERP' bits set. This 
final step results in a catalogue with a complex geometry, 
which will be demonstrated explicitly in Sec. 14.91 

4.7 Photometric catalogues 

Once each coadded image is constructed, we detect objects 
using the catalogue-construction portion of the SDSS pho- 
tometric pipeline, photo-frames. The details of frames's 
catalogue construction and obj ect measurement proce ss are 
described more fully elsewhere (|Stoughton et al.l l2002: Lup- 
ton et al., in prep.). It is nevertheless useful to review the 
important parts of the frames algorithms. 

Photo-frames requires as input a set of long inte- 
ger images, and a considerable array of inputs describing 
the properties of the telescope and the observing condi- 
tions. Principal among these is a description of the telescope 
point-spread function. For single-epoch data, frames uses 
a principal-component decomposition of the variation of the 
PSF across five adjacent fields. The components of this de- 
composition are allowed to vary as a polynomial (typically 
quadratic) in the image coordinates across each frame. As 
the coadded images have the same target PSF in every im- 
age, this target PSF is stored as the first principal compo- 
nent. For fast computation of object properties, the pipeline 
uses a double- Gaussian fit to the PSF; as this is the exact 
form of the target PSF resulting from the rounding kernel 
applied above, we simply use the target PSF parameters. 

Frames first smooths the image with the narrower of 
the two Gaussian widths describing the PSF. Collections of 
connected pixels greater than 7 times the standard deviation 
of the sky noise are marked as objects. Each object is grown 
by six pixels in each direction. For each object, the list of 
connected pixels is then culled of peaks less than three times 
the local standard deviation of the sky. 

In order to avoid including objects that represent ran- 
dom noise fluctuations, catalogue galaxies are required to 
have statistically significant (> 7a) detections in both the r 
and i bands. Note that this is a higher threshold than the 
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Figure 2. A comparison of a coadded image (upper panel); its inverse variance map (middle panel); and a single-epoch input map (bottom 
panel). The coadded image is centered on RA 01 h 56 m 34.8 s , Dec -01°10'35" (J2000). East is at top; the image spans 7.7 X 2.4 arcmin. 
The top panel shows the r-band image (units: nmgy arcsec -2 , square root stretch), and the center panel shows the inverse variance 
map (units: nmgy~ 2 arcsec 2 , linear stretch). Note the dark vertical stripes in the inverse variance map produced by bad columns, and 
the square patches due to cosmic ray hits propagating through the masking procedure. The spiral galaxy near the center of the image 
is of magnitude r = 17 .4. The single epoch image is from strip 82S, run 4263, field 310, camcol 1 (acquired on 2003 November 20 at 
airmass 1.21). The image shown is the fpC image from rerun 40 on the Data Archive Server (units: uncalibratcd, linear stretch). The 
same number of pixels are shown, but note that the single-epoch image is at the native pixel scale (0.396 instead of 0.36 arcsec) and 
hence shows a slightly larger area. 
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> 5a cut used at this stage in the standard single-epoch 
SDSS processing. This was necessitated by the fact that the 
pixel noise in the klmages is correlated by the convolution 
process. 

In the standard SDSS pipeline, frames then re-bins the 
image and repeats the search. We choose not to use objects 
found in this manner, as the shape measurements of these 
very low surface brightness galaxies would not be reliable. 

This detection algorithm is repeated in each filter sep- 
arately. Objects detected in multiple bands are merged to 
contain the union of the pixels in each band if they over- 
lap on the sky. The list of peak positions in each band is 
preserved. The centre of the resulting single object is deter- 
mined by the location of the highest peak in the r-band. 
Objects with multiple peaks are deblended: the deblending 
algorithm assigns image flux to each peak in the parent ob- 
jectB 

Once a complete list of deblended peaks (hereafter ob- 
jects) has been constructed, the properties of each peak are 
measured. For the purposes of this paper, the most impor- 
tant outputs are the MODELFLUX and MODELFLUX.IVAR param- 
eteri"! which are determined from the total flux in the best- 
fit (PSF-convolved) galaxy profile in the r band (compar- 
ing the likelihoods for an exponential and a de Vaucouleurs 
model), with the amplitude re-fit separately to each of the 
other bands. This flux measure approximates the true, to- 
tal flux in the r-band, and provides a robust colour mea- 
surement, which is crucial for photometric estimates of the 
object redshift distribution. 

The final crucial output of photo-frames, for lensing 
purposes, is a postage stamp image for every unique object 
detected in the catalogue, except for those objects for which 
the deblender algorithm failed. 

4.8 Lensing Catalogue Construction 

After photo-frames has constructed an object catalogue 
from the coadded images, we attempt to eliminate spurious 
detections, stars, and galaxies that are unsuitable for shape 
measurement. Information from the input mask (fpM) files 
is propagated through to the catalogue, so that objects that 
incorporate bad pixels identified earlier in the pipeline can 
be excluded as needed. Due to the nature of the klmages 
produced by the image coaddition, many of the standard 
SDSS flags will not be used (e.g, by construction, there are 
no saturated pixels). As we describe above, masked regions 
of the klmages are marked as interpolated; objects in the 
photometric catalogue outputs with these bits set are re- 
moved from the catalogue at this stage. Any galaxies on 
which the deblending algorithm failed are also excluded, as 
photo-frames will not generate unique postage stamps for 
these objects. 

photo-frames also attempts to classify objects as 
"stars" or "galaxies" on the basis of the relative fluxes in 



Short descrip t ions o f the SDSS deblending can be found in 
IStoughton et alj J2002I . Sec. 4.4.3) and on the SDSS website 
at http://www.sdss.org/dr7/algorithms/deblend.html A de- 
tailed paper describing the deblender is forthcoming (Lupton 
et al., in prep). 

11 http : //www. sdss3 . org/dr8/algorithms/magnitudes .php 



Table 2. Masking radius as a function of apparent stellar mag- 
nitude. 



Magnitude range Masking radius (arcsec) 

r,i < 12 100 

12 < r,i < 13 70 

13 <r,i < 14 50 

14 <r,i < 15 40 

15 < r,i < 16 30 



the point spread function and galaxy model fits (Lupton 
et al., in prep). Objects that are well-described by a PSF 
are classified as stars; we do not include these objects in the 
shape catalogue, but set them aside as aids for detecting 
systematic errors. 

To minimise these effects, we also match against a list 
of all objects classified as stars in the single-epoch SDSS cat- 
alogue:^ 2 ] with apparent magnitudes in i or r band brighter 
than 15. We remove objects from the catalogue within an 
angular separation of these bright stars that depends on the 
stellar apparent magnitude as described in Table 

In addition to these basic cuts, we cull the following 
objects from the lensing catalogue: 

(i) All objects where the model flux or ellipticity moment 
measurement failed; 

(ii) All objects within 62 pixels of the beginning or end 
of a frame; 

(iii) All objects detected only in the binned images 
(BINNED2 or BINNED4); 

(iv) All objects where a bad pixel was was close to the 
object centre (INTERP .CENTER) in either of the r or i 
bands; 

(v) All objects that are parents of blends (i.e., measured 
again in terms of the individual child objects); 

(vi) Those for which the observed r-band magnitude is 
greater than 23.5, or the i-band magnitude is greater than 
22.5. 

The magnitude cut was applied to "observed" (at the 
top of the atmosphere) rather than Galactic extinction- 
corrected magnitudes. While this leads to a non-uniform 
galaxy number density, it avoids iss ues with the limiting - 
S/N varying with position. Using the lSchlegel et aT 
dust map, with the sta ndard extinction-to-reddening ratios 
jStoughton et alj 1200 j . Table 22), along the occupied 100 
degree length of the stripe, the r band extinction A r has a 
mean value of 0.141 and a standard deviation of 0.065. (The 
i band extinction is lower by a factor of 0.76.) A simple test 
using the COSMOS Mock Catalogue (CMC, iJouvel et all 
2009) and a size culF^I at r e g > 0.47 arcsec predicts that this 
standard deviation should result in a la variation of ±3 per 

12 As our sky coverage is less complete than the single-epoch 
data, we use the single-epoch catalogues in masking so as to re- 
move objects that are in close proximity to a star that is in one 
of our masked regions. 

13 For an r e g of the PSF of 0.67 arcsec and a resolution factor 
cut at i?2 > 0.333, we expect the minimum r e jj of a usable galaxy 
to be 0.67^/0.333/(1 — 0.333) arcsec. This is of course only a very 
rough estimate, but this application of the CMC provides a simple 
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Figure 3. Tangential shear ■yt for galaxies as a function of separa- 
tion 8 from stars, as measured in the single epoch SDSS imaging 
using the shape catalogue from lReves et ahl l l201ll) . The different 
lines with points show results for bins in r-band stellar apparent 
magnitude, as labelled on the plot. The ideal expected value of 
zero is shown as a dotted horizontal line. 



cent in the galaxy density and ±1 per cent in the mean red- 
shift (z). The systematic error introduced by non-uniform 
depth, which should be second order in the amplitude of 
variations, is expected to be negligible for the purposes of 
the SDSS analysis. Note however that this will not be true 
of future projects. 

Many of these cuts are applied in only one band. The 
result of this process is to produce two separate shape cat- 
alogues, one for each of the two shape-measurement bands, 
so there are a small number of galaxies which appear in only 
one of the two catalogues. 

The SDSS photometric pipeline is known to produce 
significant sky proximity effects, wherein the photometric 
properties of objects detected near a bright star are system- 
atically biased. The effect of bright stars on the measured 
tangential shear of nearby galaxies in single-epoch SDSS 
data is shown in Fig. Motivated by the scales of the ef- 
fects seen there, we mask the regions around bright stars 
with a masking radius that depends on the apparent r-band 
(model) magnitude of the stars as given in Table [5] 



4.9 Shape measurement 

Once the final shape catalogue has been constructed, we 
use the re-Gaus s ianiza tion shape measurement method of 
Irlirata fc S eliak (2003j) to generate an ellipticity measure 
for each object. The proce ssing code and scrip t are a mod- 
ification of those used in iMandelbaum et al.l ((2005) . Re- 
Gaussianization is not an especially modern shape measure- 



and fast way to estimate the impact of marginal changes in survey 
parameters. 



ment technique, but we have used it previously on SDSS 
data, it meets our requirements for shear calibration given 
the expected statistical power, and we had well-tested code 
that interfaced to photo-frames outputs at the time of 
initiating the cosmic shear project. Therefore we chose to 
continue using it for this analysis. 



4-9.1 Overview of re-Gaussianization 

The re-Gaussi anization method was an outgrowth of pre- 
vious work bv lBernstein fc Jarvisl (|2002t ). They defined the 
adaptive moments Mi of an image / by finding the Gaussian 
Q[I] that minimises the L 2 norm — {7[/]||. A Gaussian has 
6 parameters - an amplitude, 2 centroids xi, and 3 com- 
ponents of the symmetric covariance matrix - and the last 
of these is by definition the 2x2 adaptive moment matrix. 
The ellipticity of the galaxy is defined via its components 



and 
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For Gaussian PSFs and galaxies, it is easily seen that the 
adaptive moment of the intrinsic galaxy image / can be ex- 
tracted from that of the observed image via M j> = Mj — Mr- 
If the PSF is both circular and Gaussian (a situation that 
does not arise in practise) then one can relate the ellipticity 
of the observed image to that of the true galaxy image via 
the resolution factor R2: 



R2 



and i?2 



(26) 



where we have used T to denote the trace of the adaptive mo- 
ment matrix: e.g., Tr = Mr,xx + Mr,yy Re-Gaussianization 
seeks to apply corrections to Eq. (|26[1 to correct for the non- 
Gaussianity of the PSF and the galaxvF^I 



4-9.2 Non-Gaussian galaxies 

First is the non-Gaussian galaxy correction - i.e. we con- 
sider the case of a Gaussian PSF and no n-Gaussian galaxy. 
Appendix C of iBernstein fc Jarvisl (|2002T > used a Taylor ex- 
pansion method to show that if the galaxy is well-resolved, 
then in this case Eq. ([26} could be corrected by using a 
different formula for the resolution factor, 



R-2, = 1- 



(i + /&>)r r 



where ,832 



is the radial fourth moment, 

${p A -Ap 2 + 2)I{x)g[I]{x)A 2 x 



2 J I(x)g[I](x)d 2 x 



(27) 



(28) 



14 There are also steps in the lHirata &; Se liak (2003) code that 
correct for non-circularity of the PSF. However since the co-add 
code has already circularised the PSF, these portions of the code 
are vestigial and we do not describe them here. 



© 0000 RAS, MNRAS 000, 000-000 



12 E. M. Huff et al. 



where Q[I] is the adaptive Gaussian and the rescaled radius 
p is given by 

p = ^(x-x^-MJ^x-X!). (29) 

This is equivalent to an elliptical version of the n = 4, m = 
polar shapelet (|Refregierll2003l ; iRefregier fc Baconl f2003), 
and we have P22 ~ for a Gaussian galaxy (in practise 
usually > 0). 

4.9.3 Non- Gaussian PSF 

Finally we arrive at the correction for the non-Gaussian 
PSF. We construct a Gaussian approximation Gi to the PSF 

r, 

r(g) « Gi(g) = - 1 exp (~x T Malx). (30) 

27r A /det M Gl V 2 J 

The choice Gi is chosen according to the adaptive moments 
of r. The function Gi is determined from the centroid and 
covariance, but the amplitude in Eq. (|30f) is chosen to nor- 
malise the Gaussian G% to integrate to unitvPI 

We may then define the residual function e(x) = F(a;) — 
Gi(x). It follows that the measured image intensity will sat- 
isfy / = r®/ = Gi<g>/+£®/, where ® represents convo- 
lution. This can be rearranged to yield: 

Gi®/ = l- e®/. (31) 

This equation thus allows us to determine the Gaussian- 
convolved intrinsic galaxy image if we know /. At first 
glance this does not appear helpful, since if we knew / it 
would be trivial to determine T ® /. However, / appears 
in this equation multiplied by (technically, convolved with) 
a small correction e, so equation (|31[1 may be reasonably 
accurate even if we use an approximate form for /. The 
simplest approach is to approximate / as a Gaussian with 
covariance: 

fo = , 1 exp ( - ^a; T [Mf }~ l x ) , with 

2^detM< 0) V 2 J 

= M/ Mr, (32) 

where Mi and Mr are the adaptive covariances of the mea- 
sured object and PSF, respectively. Then we can define: 

/' = /-e®/o(wr®/). (33) 

The adaptive moments of /' can then be computed, and the 
PSF correction of Eq. Q29p can then be applied to recover 
the intrinsic ellipticity 

Simple simulations with (noise-free) toy galaxy profiles 
indicate that this method has shear calibration errors at the 
level of a few percent depending on the galaxy profile, with 
the worst performance for de Vaucouleurs profiles at low 
resolution and high ellipticity l|Mandelbaum et alj|200 j . fig. 
5). Moreover, simulations of SDSS data based on real galaxy 

15 The reason for doing this is that while this increases the overall 
power /(e 2 ) of the residual function, it yields ft = 0, which 
ensures that for well-resolved objects (i.e. objects for which the 
PSF is essentially a 5-function) , the "correction" e <8> /o applied 
by equation (133 6 does not corrupt the image /. 



profiles from COSMOS, single-epoch SDSS PSFs, and re- 
alistic noise levels show that the shear calibration biases 
are not markedly differen t under more realistic conditions 
(Mandelb aum et al.ll201l1 j. An investigation of the shear cal- 
ibration bias for the SDSS cosmic shear sample is presented 
in Paper II. 

To select the galaxy sample used for the final analysis, 
we impose a resolution factor cut at ife > 0.333 in both r 
and i (we will justify this choice in Sec. 16.31 based on our 
desire to minimise additive PSF systematics). The parame- 
ters of the final shape catalogue are shown in Table [3l and 
the survey geometry can be found in Fig. 2] The apparent 
magnitude distribution in each band is shown in Fig. [5] We 
show a comparison with the single-epoch photometry for a 
representative subsample of galaxies in Fig. [U 



5 CORRELATION FUNCTION ESTIMATION 

As stated previously, the primary systematic error of con- 
cern in this paper are additive shear systematics, due to 
PSF ellipticity leaking into the galaxy shapes even after the 
PSF correction is carried out. This concern will drive our 
choice of diagnostics to use on the shape catalogues. There 
are several possible choices for diagnostics that we could use: 

(i) 1-point statistics of the star and galaxy shapes: For 
example, we calculate the mean stellar and galaxy elliptici- 
ties in bins of some chosen size and look for deviations from 
zero, including coherent patterns. We use this diagnostic in 
Sec. ET] 

(ii) The tangential shear as a function of scale around 
random points (e.g., iMandelbaum et al.ll2005T ): If there is 
some additive systematic shear, then on scales that are such 
that we start losing lens-source pairs off the survey edge, it 
will show up as a nonzero tangential shear. However, this 
test alone does not tell us much about the correlations be- 
tween systematic shears at different points, and therefore we 
ignore it in favour of more informative tests. 

(iii) Cross-correlations between the stellar shapes and 
galaxy shapes, as a function of separation 9: These corre- 
lation functions tell us not only about the amplitude of any 
systematic shear, but also about the characteristic scales 
that are affected by it. This section will describe our method- 
ology for calculating these correlation functions. 

(iv) The B-mode shear, which should be zero due to grav- 
itational lensing: While this test is an important one as it 
can signal a variety of problems with PSF correction, it is 
not strictly a measure of additive shear systematics. Thus, 
we leave this test for Paper II, which presents the cosmic 
shear analysis. 

5.1 The estimator and weighting 

In order to compute the star-galaxy cross-correlations, we 
employ a direct pair-count correlation function code. It is 
slow (~ 3 hours for 2 x 10 6 galaxies on a modern laptop) 
but robust and well-adapted to the Stripe 82 survey geom- 
etry. The code sorts the galaxies in order of increasing right 
ascension a; stars and galaxies galaxies are assigned to the 
range —60° < a < +60° to avoid unphysical edge effects 
near a = 0. It then loops over all pairs with [ai — Q2I < #max- 
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Figure 4. The angular distribution (in J2000 right ascension and declination) of the i-band galaxy catalogue. A subsample of every 
250th galaxy is shown. The r-band sample is identical except for the missing range of —00° 48' <Dec< —00° 24'. Note the complex survey 
geometry. Coverage gaps at Dec > 0.8 are primarily due to the severe PSF quality cuts made during the image coaddition step. 



Table 3. Parameters of the shape catalogue. 



Parameter 


Value 


Units 




r-band 


i-band 




Total number of source galaxies 


1328 885 




Number of sources per band 


1067 031 


1251285 




Effective number of sources downweighted by noise, 7V e ff = tzji 


882 345 


1 065 807 




Median magnitude 


21.5 


20.9 


mag AB 


Median resolution factor R2 


0.55 


0.53 




RMS measured ellipticity per component (noise not subtracted) 


0.48 


0.47 





The usual ellipticity correlation functions can be computed 
via summation over galaxies i and stars j, e.g., 



;ii, P sf 



(34) 



a/3 ' 



and similarly for ^22, ps f • Here en is the PSF-corrected galaxy 
ei for galaxy index i, and M_Elj is the stellar ei derived from 
the adaptive moments described in Sec. 14.91 The sum is over 
pairs with separation in the relevant 9 bin, and we weight 
each pair according only to the weight associated with the 
galaxy in each pair: 



(35) 



0.37 2 + al 



Following iReves et all (|201ll ). we have for weighting pur- 
poses adopted an intrinsic shape noise e rms per component 
of 0.37. The weight of a galaxy relative to a galaxy with 



perfectly measured shape is 



w(a e = 0) 1 + crl/0.37 2 ' 



(36) 



Since the imaging is taken in drift-scan mode, which in- 
troduces a potential preferred direction for PSF distortions, 
we compute our diagnostic correlations between the compo- 
nents aligned along (— e\ and — M_E1) and at 45 degrees to 
{ei and M_e2) the scan direction. 

The code works on a fiat sky, i.e. equatorial coordinates 
(a, S) are approximated as Cartesian coordinates. This is 
appropriate in the range considered, \8\ < 1.274°, where the 
maximum distance distortions are !<5„ lax = 2.5 x 10 -4 . 

All of our shape correlations are computed over the 
range 1 < 6 < 120 arcminutes, evenly spaced in log#. 
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Figure 5. The distribution of observed (not corrected for Galactic extinction) apparent galaxy model magnitudes in the u, g, r, i, and 
2 bands (top left, middle left, top right, middle right, and bottom panels). In all cases, the solid line shows the apparent magnitudes for 
all unique extended objects; dotted and dashed show the r- and i-band lensing catalogues, respectively. 
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Figure 6. The comparison of the observed (not corrected for Galactic extinction) model magnitudes of galaxies in the coadd lensing 
catalogue with magnitudes for the same objects in the best run at that position in the single epoch imaging. Contours are 68 and 95 per 
cent of the total matches. The asymmetry around the 1:1 line at faint magnitudes is due to the flux limit in the single epoch images. 
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5.2 Statistical errors 

The direct pair-count correlation function code can directly 
compute the Poisson error bars, i.e. the error bars neglecting 
the correlations in ei a KJEaj between different pairs. This 
estimate of the error bar is 

a ^ ++m = E^lW . (37) 

Equivalently this is the variance in the correlation function 
that one would estimate if one randomly re-oriented all of 
the galaxies. As the star-galaxy correlations described here 
are approximate indicators of the amplitude of the additive 
PSF shear, and not precision estimates for use in a cosmic 
shear analysis, we will not attempt to infer the covariance 
matrix for the full diagonal star-galaxy cross-correlation 
functions. 



6 DIAGNOSTICS 

Here we present our two main systematics tests described in 
Sec. O namely the 1-point statistics of the stellar and galaxy 
ellipticities, and the star-galaxy shape cross-correlations. In 
order to do this calculation, we must define a star cata- 
logue, which relies on the Photo star-galaxy separation. 
The colours of the objects selected as stars by Photo are 
shown in Fig. [7] As shown, they agree with previous de- 
terminations o f the c olours of the stellar locus, e.g., from 
iRichards et al. I <|2002T ). 

6.1 Average shapes 

We first estimate the influence of residual PSF ellipticities 
on the galaxy shapes by mapping the stellar shape field. 

We computed a set of star shapes binned by right ascen- 
sion and declination. The stars were chosen to be moderately 
faint, 19.5 < r < 21.5, such that they were not used to es- 
timate the PSF model in the single-epoch images that was 
used to construct the rounding kernel applied to each single 
epoch image. Figure [8] shows the results: the mean stellar 
ellipticities are usually small, of order 10 -3 , but in the r 
band in a particular declination range covered by camcol 2, 
the shapes are systematically elongated in the scan direc- 
tion by — ei = 0.005. We find no significant changes in the 
amplitude of this artifact when splitting the stellar popu- 
lations by colour (r — i < or > 0.3) or by apparent mag- 
nitude (r < or > 20.5). We did not definitively determine 
the source of this elongation, but we have confirmed that 
it appears in the single-epoch SPSS imaging (including the 
galaxy shape cata logues from iMandelbaum et al.l 120051 and 
iReves et al.ll201lh . so is not merely an artifact of the coad- 
dition and catalogue-making process of this worlj^l. There 
is no counterpart feature in the i-band. Given the fact that 
this feature may plausibly arise due to problems with the 

6 One possible explanation is incorrect non-linearity corrections 
for the r-band camcol 2 CCD. The stars used to construct the 
PSF model are sufficiently bright that they require non-linearity 
corrections, but the stars used for our tests here do not. Therefore 
if the non-linearity correction is wrong for that CCD, it could 
affect the PSF model for that CCD alone. 



2 




1 r ■ , ■ ■ 1 
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Figure 7. Density contour plots in colour-colour space for objects 
identified as stars using Photo's star-galaxy separation based 
on the concentration of the light profile; the contours containing 
68 a nd 95 per cent o f the density are shown. The stellar locus 
from IRichards et al.l <|2002h is shown as a solid line. This plot 
includes correction for Galactic extinction, for fair comparison 
with previous results. 

single-epoch PSF model used to determine the proper con- 
volution kernel to achieve the desired coadd PSF, we exclude 
all r-band galaxy data in camera column 2 from the cosmic 
shear analysis. 

6.2 Star-galaxy cross-correlation 

Our primary tasks in producing a shear measurement are 
to demonstrate that the additive systematic shear is below 
the target threshold set above (Sec. [2} , and that our shape 
measurement method allows us to correctly translate the 
measured shapes into shears with sufficient accuracy (a task 
that we will handle in more detail in Paper II). 

In order to test for residual additive shear systemat- 
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Figure 8. The mean cllipticities of stars in the r band as a function of declination for different ranges of right ascension, as indicated at 
the upper right. The top panels show the r band and the bottom panels show the i band, while the left and right panels show different 
cllipticity components. This was computed using a version of the star catalogue prior to final cuts. Note the spurious effect in camcol 2 
r band in the e\ component (declinations —0.8 to —0.4°). The apparent magnitude range for this plot was 19.5 < r < 21.5. 



ics, we calculate the cross-correlation between the measured 
shapes of the stars and those of the galaxies in our sam- 
ple. Any remaining contribution to the inferred shear field 
of the galaxies that is sourced by the point-spread function 
will produce a non-zero cross-correlation. 

We estimate the star-star and star-galaxy cross corre- 
lations as in Eq. ()34p for all star-galaxy pairs within and 
between the r and i bands. The results for the star-galaxy 
correlations are shown in Fig [9] For the systematic error 
diagnostics considered here, we are primarily interested in 
computing the cross-correlation between resolved galaxies 
and unresolved point sources. 



6.3 Resolution cuts 

Due to the PSF dilution correction applied to all galaxy 
shapes in Sec. 14.91 noisy measurements of poorly resolved 
galaxies can significantly amplify any residual additive shear 
systematics not corrected for in the rounding kernel process. 
To assess the effects of a resolution cut, we compute the 
star-galaxy cross-correlations in each band for R2 > 0.25, 
> 0.333, and > 0.4. Adopting the second of these of these 
thresholds appears to be sufficient to minimise the ampli- 
tude of the star-galaxy shape correlation signal. As a result, 
we adopt a cut of R2 > 0.333 for both the i and r-band 
galaxy catalogues. 



6.4 Star-galaxy separation 

6.4-1 Contamination of star sample by galaxies 

A nonzero amplitude of £ S9 can also be produced by im- 
perfect star-galaxy separation. Poorly-resolved galaxies mas- 
querading as stars sample both the PSF- and cosmic shear- 
sourced shape fields. If the fraction of stars that are actually 
mistakenly classified as galaxies is / ga i, then the measured 
£ S9 will include a contribution proportional to f ga l£-y- As 
the ellipticity of nearly-unresolved galaxies will be diluted 
by PSF convolution, this represents an upper limit to the 
level of star-galaxy correlation that can be introduced via 
imperfect star-galaxy separation. 

The photo-frames pipeline classifies an object as a 
star or a galaxy on the basis of the relative fluxes of PSF 
and galaxy model fits to the object's surface brightness 
profile. We have already confirmed that we get a reason- 
able stellar locus from this determination, compared with 
that from single-epoch imaging (Fig. [7]). As another check 
on this scheme, we have defined a sample of stars for 
which aperture-matched UKIRT Infrared Deep Sky Survey 
(UKID DS) colours are available . The UKIDSS project is de- 
fined in lLawrence et al l (|2007r i. UKIDSS uses the U KIRT 
Wide Field Camera (WFCAM; ICasali et all 120071). Th e 
photometric system is described in Hewett et al, (2006), 



and the calibration is described in Hodgk in et al 



2009) 
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Figure 9. The cross-correlation of star shapes with galaxy shapes, for the following pairs of bands: (i, i) in the upper left, (r, r) in the 
upper right, (r,i) in the bottom left, and (i,r) in the bottom right panels. All results are shown as 10 4 6§. The (ej e±) correlation is the 
solid line, while the (e2 e2) correlation is the dashed line. The dot-dashed line shows the expected cosmic shear (e^e^) shape-shape 
correlation for a survey of this depth and size, with shot-noise errors. The triple dot-dashed lines shows the ideal value of zero for the 
star-galaxy correlations. Statistics shown are for stars with apparent i and r band magnitudes between 19.5 and 21.5. 



T he pipeline processin g and science archive are described 
in lHamblv et al.l (|2008l ). Stars and gal axies separate fairl y 
cleanly in J — K,r — i colour space (e.g-. lBaldrv et aTl feoiCO. 
so we attempt to use a matched catalogue from Bundy et al. 
in prep, to put some limits on galactic contamination of the 
stellar sample (see Fig. I10|) . This constraint on / ga i will give 
us our upper limit /gai^T on the £ S9 due to contamination of 
the star sample by galaxies. 

We match the objects classified as stars in both bands 
from our coadd to UKIDSS objects with valid J — K colours; 
objects with angular separations between the two catalogues 
less than one arcsecond are identified. We find 93 753 such 
stars (as classified by Photo). Of these, 11331, or 12 per 
cent, have J — K,r — i colours inconsistent with the stellar 
population. The UKIDSS matches are shallower than the 



rest of the catalogue in the i band, but of comparable depth 
in the r band. Only 16 per cent of our stars have UKIDSS 
matches in either band, however, so the contamination frac- 
tion is not well-constrained in the entire star sample. 

If, however, this fraction is representative of the galaxy 
contamination in the entire stellar catalogue, then for an 
unresolved population with a typical resolution just below 
our resolution cut, that level of contamination would ex- 
plain a substantial fraction of the residual PSF systematic 
amplitude that we see. 

As a test for this, we compute the star-galaxy shape 
correlation using only those objects identified as stars in the 
manner described above. The results are shown in Fig. 1111 
As shown, for this population, the amplitude of the star- 
galaxy correlation is significantly reduced below the star- 
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Figure 10. The cut in (J — K, r — i) space, denned using 
extinction-corrected magnitudes, that was used to separate stars 
from galaxies using the UKIDSS data. Objects below the curve 
(i.e. blue in J — K) are colour-classified as stars, while those above 
the curve are colour-classified as galaxies. 



galaxy correlations. This is suggestive that some of the star- 
galaxy signal may arise from galaxy contamination of the 
star sample. However, because the UKIDSS data does not 
cover the entire footprint of Stripe 82, this test is not con- 
clusive. 

After all of the above cuts have been applied, the final 
shape catalogue consists of 1067 031 r-band and 1251285 
i-band shape measurements, over an effective area of 140 
and 168 square degrees, respectively. 

6.4-2 Contamination of galaxy sample by stars 

The other type of contamination, that of the galaxy sample 
by stars, will tend to dilute lensing statistics measured using 
our catalogue. Because we wish to understand the contam- 
ination in a representative sample of our galaxies (not just 
the ones bright enough to have a match in the UKIDSS cat- 
alogue), we use a different strategy to estimate this type of 
contamination. 

The targeting photometry used for the DEEP2 survey 
comes from the Canada- France-Hawaii Telescope (CFHT), 
and in the two DEEP2 fields on Stripe 82, the typical seeing 
is 0.7-0.8 arcsec, nearly a factor of two better than in our 



coadds |Coil et al.l 12004;) . The catalogues from this imaging 
were publicly released 17 ! as part of DEEP2 Data Release 1. 
We use the star versus galaxy classification for galaxies in 
the coadds that have detections in the DEEP2 targeting 
photometry as a way to estimate the stellar contamination 
in our catalogues in those fields. 

We first match our galaxy catalogue against the DEEP2 



targeting photometry, finding matches for 96 per cent of our 
galaxies. We then eliminate those that are marked as bad 
data or saturated in the DEEP2 catalogues. For the remain- 
ing objects, the star /galaxy separation works as follows (see 
ICoil et al.ll200i for more details): clearly extended objects 
are flagged as such, and we consider those as secure galaxy 
detections. Compact objects are assigned a quantity 'pgal' 
in the range [0, 1], representing the probability that the ob- 
ject is a galaxy based on its colour and magnitude. To es- 
timate the total stellar contamination, we sum the values 
of (1 — pgal) for all of our galaxies that matched against 
compact objects, and compare that to the total number of 
compact and extended matches. The result is a stellar con- 
tamination of 1.7 per cent for both the r and the i band 
lensing catalogues. 

One issue in "calibrating" the stellar contamination 
analysis is that the stellar density varies along the stripe. 
The Galactic latitudes of the two DEEP2 fields are -54° 
and -56°, but the "start" of our stripe (RA 310°, Dec 0°) 
is at I — 46°, b = —24°. These low-RA fields are not only 
at lower Galactic latitude, but also look inward toward the 
bulge. As a simple test for this, we compared the ratio of 
the 19.5 < i < 21.5 star density to the source galaxy den- 
sity in the DEEP2 fields (0.16) versus the stripe as a whole 
(0.46). The ratio of these suggests that the DEEP2 field star 
abundances should be scaled up by a factor of 2.8 to be rep- 
resentative of Stripe 82 as a whole£j A potential issue in this 
method of rescaling is that the stars of different magnitude 
need not be distributed in the same way. To test for this, we 
repeated the above computation for stars at fainter magni- 
tudes, with 22 < r < 22.5 and found a rescaling factor of 1.4. 
This suggests that the larger factor is conservative, but it is 
also possible that the stellar density is being homogenized 
by galaxy contamination of the stars. The true recalibration 
factor for DEEP2 is probably greater than unity, but not 
larger than 2.8. 

The statistical error on this contamination is ~ 10 per 
cent (Poisson error); the systematic uncertainty in how it 
applies to a real lensing analysis, given the strong variation 
in stellar density across the stripe, is far larger. We therefore 
address the issue of real corrections for a lensing analysis in 
Paper II. 



7 DISCUSSION 

We have constructed deep, lensing-optimised, coadded imag- 
ing of the SDSS equatorial stripe. The procedure is de- 
signed to enable the construction of a catalogue suitable 
for weak lensing measurements by suppressing the effects of 
PSF anisotropy on the measured galaxy shapes below the 
level of statistical error achievable with a cosmic shear sur- 
vey on this Stripe. 

The galaxy density of ~ 2 per arcmin 2 is relatively low 
for a cosmic shear survey. However, it makes sense given our 
depth limits and large PSF of the SDSS, even by ground- 
based standards. A s a simple point of comparison, the CMC 
l|jouvel et al.l [2009:') is commonly used to forecast galaxy 
yields for dark energy investigations. The effective radius 
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Figure 11. The cross-correlation of UKIDSS-selected star shapes with galaxy shapes, for the following pairs of bands: (i, i) in the upper 
left, (r, r) in the upper right, (r, i) in the bottom left, and (i, r) in the bottom right panels. All results are shown as 1O 4 0£. The (ei ei) 
correlation is the solid line, while the (e2 e.'i) correlation is the dashed line. The dot-dashed line shows the expected cosmic shear (e+ e+) 
shape-shape correlation for a survey of this depth and size, with shot-noise errors. The triple dot-dashed lines shows the ideal value of 
zero for the star-galaxy correlations. 



of the coadded PSF is 0.67 arcsec; for Gaussians one would 
then expect that our cut on R2 > 0.333 should correspond to 
a cut on effective radius of r e g > 0.67^/0.333/(1 — 0.333) = 
0.47 arcsec. Using the 2011 August 15 update of the CMC, 
and imposing this cut as well as r < 23.5 and i < 22.5 (ob- 
server frame at A r = 0.141), we forecast a galaxy density 
of n = 2.7 arcmin -2 and a mean source redshift (z) = 0.51, 
before any small-scale masking due to e.g. bright stars and 
bad columns. Therefore the final galaxy yield is broadly con- 
sistent with the tools being used to design next-generation 
surveys. 

This procedure is successful if and only if it renders the 
PSF shape distortions sufficiently small that they are negli- 
gible compared to the statistical errors expected for a cosmic 
shear signal in this survey. To estimate the amplitude and 



scale-dependence of the residual PSF systematics, we have 
measured the star-galaxy and star-star ellipticity correlation 
functions in our catalogue. We now fit a power law of the 
form: 

£ sg = A0- p (38) 

to the average of the four measured star-galaxy cross- 
correlations, using the Poisson errors output by the cor- 
relation function code. The best-fitting power law and av- 
erage star-galaxy correlations are shown in Fig. 1121 with 
(A,p) = (1.4 x 10" 5 ,0.85). 

We compare the ratio of this best-fitting power law 
to the shot-noise errors expected for a galaxy shape auto- 
correlation fun ction for this su r vey. T o estimate the shot 
noise, we follow [Schneider et alj ((2002) to calculate the sta- 
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Figure 12. Average of the ri, ir, rr, and ii star-galaxy cross- 
correlation functions, and the best-fit power-law from Eq. 1381 1. 



tistical errors expected due to shot noise for a 168 square 
degree lensing survey with an effective source surface den- 
sity of 2 galaxies per arcmin 2 : 



Var(C) 



(3.979 x 10" 



"off 



0.3 



30 arcmin 



Area 
ldeg 2 



1 arcmin 



(39) 



The ratio of the systematics amplitude to the shot noise 
is shown as a function of scale in Fig. 1131 From this, we can 
see that PSF systematics for these data should be, on aver- 
age, 50 per cent of the size of the statistical error budget for 
a cosmic shear measurement with this catalogue; on degree 
scales, this becomes comparable to the shot-noise errors. As 
discussed above, this is an upper limit for three reasons: (1) 
imperfect star-galaxy separation at the level of several to 
ten per cent can produce a star-galaxy correlation signal in 
the absence of uncorrected PSF effect; (2) the response of 
a galaxy shape to a PSF anisotropy is typically less than 
unity; and (3) the Poisson error estimate will underestimate 
the true variance of the signal on larger scales, where cosmic 
variance becomes important. 

In addition, masks defined as sets of pixels can intro- 
duce a shape selection bias. We tested the effects of masking 
on the spurious shear statistics during the catalogue-making 
step by applying a strict cut to eliminate those regions of the 
coadd imaging with fewer than seven contributing single- 
epoch images. Introducing this cut actually increased the 
spurious shear amplitude; the star-galaxy correlations in the 
presence of this more aggressive masking step reach an am- 
plitude of 10~ 5 at degree scales. 

The relative contributions of mask selection and PSF 
anisotropy biases can be ascertained from the relative ampli- 
tudes of the star-star and star-galaxy correlation functions. 
A PSF anisotropy will produce a similar signal in both met- 



Figure 13. Ratio of the best-fit star-galaxy cross-correlation 
power law to the expected shot-noise errors for a cosmic shear 
measurement using the catalogues described here. As the star- 
galaxy amplitude is only poorly constrained, this should be taken 
as a rough indication of the level of significance of the systematics. 



rics. The stellar shape dispersion and typical stellar size is 
much smaller than that of the galaxies, so a selection bias 
will produce a much larger systematics signal in the star- 
galaxy correlation function than in the star-star correlation 
functions. This is indeed the case, as shown in Fig. [14] - 
substantial evidence that mask selection bias will be a sig- 
nificant fraction of the systematic error budget. Excluding 
objects near the boundaries of masked regions on the basis 
of their centroid positions could remove this effect; however, 
as Fig. [13] shows, the statistical errors should dominate for 
this catalogue, so reducing the catalogue further at this stage 
would not improve the quality of a final cosmic shear mea- 
surement. We also considered trying to simulate and sub- 
tract the masking bias. Ultimately, however, we settled on 
a more empirical approach: as described in detail in Paper 
II, the galaxy shear autocorrelation function used in the fi- 
nal analysis determines and subtracts the mean ei in each 
declination bin. 

These results suggest that a cosmic shear analysis that 
is statistics-limited is possible with these data. We have 
shown that the effects of the point-spread function are small 
compared to the statistical errors. The mask selection bias is 
larger, but still on average significantly smaller than the ex- 
pected statistical errors. A full analysis involving the source 
redshift distribution, shear calibration, and the cosmologi- 
cal implications of the two-point statistics of these data will 
follow in Paper II. 

The systematics floor for the rounding kernel method 
we have employed here is set by the SDSS PSF model. In- 
accuracies in this PSF model are docume nted both here 
(Fig. [S]) and in other work (|Reves et al.ll201lT ). Coherent vari- 
ations in the PSF model errors in both components across 
the camera columns are visible with a characteristic ampli- 
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Figure 14. The correlation functions of star shapes in the following pairs of bands: (r, r) in top left, in top right, and (r, i) in 

bottom left panel. All results are shown as 10 4 $f;. The (ei e\) correlation is the solid line, while the (e2 ei) correlation is the dashed line. 
The dot-dashed line shows the expected cosmic shear (e+ e+) shape-shape correlation for a survey of this depth and size, with shot-noise 
errors. The lower right panel shows the mean stellar cross-correlation signal as a fraction of the expected Poisson error for a cosmic shear 
measurement using this catalogue The triple dot-dashed lines shows the ideal value of zero for the star auto-correlations. 



tude of 2 x 1CP 3 . Aside from the very striking and atypi- 
cal effect seen in the r band in camcol 2, it is likely that 
the shortcomings of the polynomial interpolation method 
employed in Photo play an important role here, as doc- 
umented in iBerge et al.l (|201ll ) for more general simulated 
ground-based data. As this is close to the level of residual 
PSF systematics seen in our final lensing catalogue, it is 
very likely that an improvement in the underlying model 
construction would allow the rounding kernel method de- 
ployed here to achieve a greater level of systematics control. 

The masking problem that we have identified is not ex- 
tensively treated in the literature; to the knowledge of the 
authors, it has not been taken into account in existing stud- 
ies. It is standard in modern photometric pipelines to define 
the survey mask and object rejection algorithms in terms of 



sets of pixels, rather than (for example) galaxy centroids, 
which is the ultimate source of the masking bias we see 
here. This effect will be important to take into account in 
the photometric pipeline construction in the next generation 
of lensing measurements. If possible, masking-related biases 
(and more generally, survey uniformity) should also be ad- 
dressed at the observing strategy level. In this regard, the 
SDSS Stripe 82 technique of scanning the sky along the same 
guiding great circle many times, while appropriate for super- 
novae or transient searches, was highly non-optimal from the 
perspective of producing a uniform quality co-added image, 
since bad columns and other defects always occur at the 
same positions. Even dithering successive runs in the cross- 
scan (declination) direction by of order 10 arcsec would have 
helped this project enormously. 
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In Paper II, we will use the catalogue described here to 
measure cosmic shear. While this work was underway, we 
learned of a parallel effort by Lin et al. (2011). These two 
efforts use different methods of coaddition and different sets 
of cuts for the input images and galaxies; what they have 
in common is their use of SDSS data (not necessarily the 
same set of runs) and their use of the SDSS Photo pipeline 
for the initial reduction of the single epoch data and the 
final reduction of the coadded data (however, they use dif- 
ferent versions of Photo). Using these different methods, 
both groups have attempted to extract the cosmic shear 
signal and its cosmological interpretations. We have coor- 
dinated submission with them but have not consulted their 
results prior to this, so these two analysis efforts are com- 
pletely independent, representing an extreme version of two 
independent pipelines. 

The PSF correction method described here is suitable 
for deployment in the next generation of weak lensing sur- 
veys, or generally any survey with many fully sampled im- 
ages of the same region[]3 All of these surveys will include 
multi-epoch data over their full footprint, fn many cases the 
image quality (as measured by PSF isotropy and size) distri- 
bution may be fairly narrow, since the "shape measurement" 
bands will be acquired in the best seeing conditions. In these 
cases, the rounding kernel method will result in little loss of 
information. 
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